منابع مشابه
Audio-Visual Speech Cue Combination
BACKGROUND Different sources of sensory information can interact, often shaping what we think we have seen or heard. This can enhance the precision of perceptual decisions relative to those made on the basis of a single source of information. From a computational perspective, there are multiple reasons why this might happen, and each predicts a different degree of enhanced precision. Relatively...
متن کاملAudio-visual speech recognition is consistent with Bayesian optimal cue combination
In the AV* condition, we intended to decrease the amount of information provided by the visual stimulus while preserving as much as possible the appearance of the talking face (see Figure S1). A natural appearance of the visual stimulus has been found to be important for effective AV fusion of speech (Schwartz JL et al., 2004). To this end, we used an “Active Appearance Model” (AAM) – a compute...
متن کاملContinuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal model...
متن کاملExpressive audio-visual speech
We aim at the realization of an Embodied Conversational Agent able to interact naturally and emotionally with user. In particular, the agent should behave expressively. Specifying for a given emotion, its corresponding facial expression will not produce the sensation of expressivity. To do so, one needs to specify parameters such as intensity, tension, movement property. Moreover, emotion affec...
متن کاملAudio Visual Speech Enhancement
This thesis presents a novel approach to speech enhancement by exploiting the bimodality of speech production and the correlation that exists between audio and visual speech information. An analysis into the correlation of a range of audio and visual features reveals significant correlation to exist between visual speech features and audio filterbank features. The amount of correlation was also...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLoS ONE
سال: 2010
ISSN: 1932-6203
DOI: 10.1371/journal.pone.0010217